A spectral clustering method for microarray data
نویسندگان
چکیده
This paper considers a clustering method motivated by a multivariate analysis of variance model and computationally based on eigenanalysis (thus the term “spectral” in the title). Our focus is on large problems, and we present the method in the context of clustering genes using microarray expression data. We provide an e5cient computational algorithm and discuss its properties and interpretation in statistical and geometric terms. Leukemia and Melanoma data sets are analyzed to demonstrate the use of the method, and simulations are carried out to compare our method with two other clustering algorithms. We extend the method to enable supervision by either gene or array characteristics. c © 2004 Elsevier B.V. All rights reserved.
منابع مشابه
Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملSegmentation of cDNA Microarray Images using Parallel Spectral Clustering
Microarray Image Microarray technology generates large amounts of expression level of genes to be analyzed simultaneously. This analysis implies microarray image segmentation to extract the quantitative information from spots. Spectral clustering is one of the most relevant unsupervised methods able to gather data without a priori information on shapes or locality. We propose and test on micro...
متن کاملSpectral Clustering Gene Ontology Terms to Group Genes by Function
With the invention of biotechnological high throughput methods like DNA microarrays, biologists are capable of producing huge amounts of data. During the analysis of such data the need for a grouping of the genes according to their biological function arises. In this paper, we propose a method that provides such a grouping. As functional information, we use Gene Ontology terms. Our method clust...
متن کاملHybrid Algorithm for Clustering of Microarray Data
Clustering is a crucial step in the analysis of gene expression data. Its goal is to identify the natural clusters and provide a reliable estimate of the number of distinct clusters in a given data set. In this paper we propose new hybrid algorithm for clustering of microarray data based on spectral clustering and k-means. Our algorithm consist of four steps, including preprocessing or filterin...
متن کاملبه کارگیری روشهای خوشهبندی در ریزآرایه DNA
Background: Microarray DNA technology has paved the way for investigators to expressed thousands of genes in a short time. Analysis of this big amount of raw data includes normalization, clustering and classification. The present study surveys the application of clustering technique in microarray DNA analysis. Materials and methods: We analyzed data of Van’t Veer et al study dealing with BRCA1...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 49 شماره
صفحات -
تاریخ انتشار 2005